Honey bee protein atlas at organ-level resolution 
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Genome sequencing has provided us with gene lists but cannot tell us where and how their encoded products wori< 
together to support life. Complex organisms rely on differential expression of subsets of genes/ proteins in organs and 
tissues, and, in concert, evolved to their present state as they function together to improve an organism's overall re- 
productive fitness. Proteomics studies of individual organs help us understand their basic functions, but this reductionist 
approach misses the larger context of the whole organism. This problem could be circumvented if all the organs in an 
organism were comprehensively studied by the same methodology and analyzed together. Using honey bees [Apis mellifera 
L.) as a model system, we report here an initial whole proteome of a complex organism, measuring 29 different organ/ 
tissue types among the three honey bee castes: queen, drone, and worker. The data reveal that, e.g., workers have 
a heightened capacity to deal with environmental toxins and queens have a far more robust pheromone detection system 
than their nestmates. The data also suggest that workers altruistically sacrifice not only their own reproductive capacity 
but also their immune potential in favor of their queen. Finally, organ-level resolution of protein expression offers 
a systematic insight into how organs may have developed. 



Honey bees epitomize hard work and evoke thoughts of honey, 
stings, and their telltale black and gold coloration. Apis mellifera L. 
(the western or European honey bee) is one of the few insects that 
humans have domesticated, and now they contribute tens of bil- 
lions of dollars per year to the value of agricultural crops. Bees 
have been dying at unprecedented rates across much of the world 
over the past six years, leading to an increased public awareness 
of their importance. They are one of the few eusocial animals 
known; they are organized into a three-caste system where re- 
production is delegated specifically to the single queen and hun- 
dreds of drones (males), while caring for young, foraging, and 
defense fall to the female workers. Pheromones, especially those 
emitted by the queen, control the physiology and behavior of 
hive members. Despite all hive members having the same ge- 
nome, workers live only for several weeks in the summer, while 
the queen can live for years. While we partially understand how 
bees' social behaviors manifest (Robinson et al. 2008), we have 
little mechanistic understanding of what allows the castes to 
carry out their individual duties. 

Completion of the honey bee genome sequence in 2006 
constituted a quantum leap forward (The Honey Bee Genome Se- 
quencing Consortium 2006) in our molecular understanding of 
this insect. Comparative genomics revealed that many gene fam- 
ilies had expanded (e.g., of genes coding for brood food and 
odorant reception) or contracted (e.g., of immunity and detoxi- 
fication genes) relative to those of Drosophila melanogaster. These 
are merely a few examples of the insight that a sequenced genome 
can provide, in addition to providing a genetic "parts list" that can 
be used by the research community: "with the genome at hand, 
prospects are bright for elucidating the molecular and genetic 
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bases of many complex traits. . (The Honey Bee Genome Se- 
quencing Consortium 2006). While an organism's chromosomes 
can now be sequenced from telomere to telomere, defining the 
complete transcriptome or proteome of any life-form remains be- 
yond our capabilities, especially for multicellular organisms where 
transcriptomes and proteomes will be organ if not cell specific. 
Transcript atlases based on microarrays exist for some species 
(Edwards et al. 2010; Huttlin et al. 2010) and RNA-seq data for 
specific tissues are becoming available (Krupp et al. 2012). For 
multicellular organisms, however, our view of the proteome is 
largely confined to one or a few organs, with just two exceptions: 
The antibody-based Protein Atlas is examining several different 
human proteins (Uhlen et al. 2010), while a very recent report 
has cataloged tissue expression across 28 mouse tissues (Geiger 
et al. 2013). Here we present the protein expression profiles across 
all three castes and across essentially all organs in adult bees. 
These data reveal the remarkable specialization required of the 
proteome to generate distinct organs: major caste-related trends 
observed for enzymes responsible for digestion, immunity, and 
detoxification, to name a few. This represents one of the few 
whole-body, cross-gender, and cross-caste proteome maps for 
any multicellular organism, and is a comprehensive resource 
not only for bee researchers but also in comparative functional 
genomics. 

Results and Discussion 
Honey bee proteins 

We dissected 29 organs from adult worker, drone, and queen 
honey bees, generating biological triplicates of all and pooling 
specimens where necessary to collect enough material. Bees were 
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taken from three different colonieS; all of which are first- or second- 
generation descendants from a single mother queen. Samples were 
analyzed by liquid chromatography mass spectrometry (LC-MS/ 
MS), and all raw spectral data are available in the honey bee 
PeptideAtlas (Chan et al. 2011b) along with all the bee-specific 
peptide spectra our group has published previously, which to date 
represent the largest repository of honey bee fragment mass 
spectra. This freely accessible resource allows the growing com- 
munity of bee mass spectrometrists to plan targeted proteomic 
studies for their protein of interest, a major time-saving improve- 
ment from the trial-and-error work that is usually associated with 
target selection. Furthermore, Gene Ontology annotation by 
BlastZGO has already been performed for each protein hit, and 
other helpful parameters such as peptide hydrophobicity and fre- 
quency of observation. 

In total, 2288 honey bee proteins were identified with at least 
two peptides with a protein false discovery rate of 1.2%. The 
number of identifications initially surprised us, but bees have 
a considerably smaller genome than mammals; even so, we typi- 
cally identify relatively (to number of annotated genes) fewer 
proteins in a bee sample compared with an equivalent amount of 
human protein. This may suggest that there are still many un- 
annotated or incorrectly annotated genes in bee, that the distri- 
bution of protein abundances in bee are more skewed toward lower 
copy numbers, or that there are a large number of polymorphisms 
that are not considered in database searching. Thus, the profiles for 
each organ here are most certainly incomplete, but organ-specific 
patterns can already be distinguished despite the number of proteins 
identified, similar to how organ-specific patterns can also be observed 
with the 100 most abundant proteins in mouse organs (Geiger et al. 
2013). Likewise, while we used every organ we could to maximize 
protein identifications, there were still several identified structures 
that could not be obtained in sufficient quantities to allow effective 
analysis. Another factor that limited the depth of coverage of each 
organ was the difficulties in raising (particularly queens) and dis- 
secting enough bees to obtain sufficient amounts of protein for 
analysis (e.g., thoracic glands and heart). Here we are able to identify 
about 2000 proteins, which is about one-fifth of the predicted 
— 10,000 genes in the bee genome (Elsik et al. 2007), but even still 
there is more that could be learned with more exhaustive sampling 
and improved mass spectrometry technology. 

With the exception of three caste-specific organs (spermatheca, 
testes, and mucus gland), peptides derived from all organs and all 
castes were isotopically labeled for relative quantitation by LC-MS/ 
MS (summarized in Fig. lA). This approach has been widely used by 
us (Chan et al. 2009) and others (Kovanich et al. 2012) and provides 
more accurate data than label-free approaches. This analysis resulted 
in 31,669 individual measurements of relative protein levels, with 
drone or queen levels being measured relative to the worker level in 
all cases. In order to express values in such a way as to allow com- 
parisons across castes without having to always think of the worker 
level as equal to 1.0, the ratios were converted into percent values, 
where the expression levels for a given protein in drone, queen, and 
worker sum to 100%. Classic "housekeeping" proteins such as actin 
(Fig. IB) showed nearly uniform expression across all organs and 
castes. In contrast, the largely female-specific protein vitellogenin 
(gi:58585104) (Fig. IC), particularly important for reproductive 
females (i.e., queens), is clearly most abundant throughout the 
queen's body, with lower levels detected in workers. The data for all 
quantifiable proteins can be browsed and searched in an interactive 
graphical format (http://www.chibi.ubc.ca/faculty/leonard-foster/ 
foster-lab/data/ under Bee Protein Atlas). 



Caste-specific differences in honey bee organs 

The quantitative aspects of the data set presented here cover two 
dimensions of bee biology: (1) across dissectible bee organs and 
(2) across castes. Anywhere from 3% to 16% of the quantified 
proteins in each organ were significantly different among castes 
at the P < 0.05 level (Fig. ID), implying that some organs have 
more caste-specific functions, or specialization, than others. By 
ranking each worker-drone and worker-queen organ according 
to the percentage of proteins that were significantly different 
(Fig. IE) it is possible to visualize how similar the drone or queen 
organ is to the equivalent baseline worker organ. Near the center 
of the wheel (wheel position 1,3-7) are a number of drone organs 
responsible for what would be expected to be caste-independent, 
basic life functions (e.g., heart, eyes). Such similarity was not the 
case for the same comparisons between the worker and queen. 
For example, the drone's mandibular glands (wheel position 1) 
are much more similar to worker than are the queen's (wheel 
position 8). The secretions from the queen's mandibular glands 
have an enormous impact on colony behavior and physiology, 
and the dissimilarity to worker glands may reflect this critical 
specialization. 

Prior to starting this analysis, we had assumed that tissues 
such as the legs would show minimal caste differences. Worker 
hind legs are the only ones to carry pollen sacs, but the basic 
function of all legs in all castes is the same: locomotion. In- 
triguingly, however, the profiles of drone or queen to worker legs 
(wheel positions 21, 32, 36, 40, 45, 48) were actually quite differ- 
ent. Other organs such as the galea (wheel positions 39, 42) and 
flagella (distal portion of the antenna, wheel positions 44, 46) were 
also among the most caste diverse. The most peculiar is the drone 
thoracic salivary gland, the most divergent organ from the worker 
above all others. Very little is known of its function other than to 
moisten food before it enters the alimentary canal. These obser- 
vations hint at more substantial variations in the function and/or 
efficiency of organs among the castes than was previous appre- 
ciated. In humans, there are measurable differences between 
males and females with regard to lipid metabolism (Sugiyama and 
Agellon 2012), the handling of oxidative stress (Vina et al. 2011), 
and the immune system (Oertelt-Prigione 2012), to name a few. 
Hence, it is reasonable that honey bees should have diverse caste 
differences in their organs, and we explore this phenomena in 
more detail below. 

Divergent metabolic capacity of castes 

Males and females of most animal species perform gender-specific 
roles and display gross anatomical differences, but the mecha- 
nisms underlying these differences are ill defined. Queens, drones, 
and workers have distinct dietary needs, which should be reflected 
in their metabolic capacity, but this notion has never been ex- 
plored at the molecular level. The exact composition of their food 
is unknown, but generally workers are fueled by carbohydrates 
from honey, queens rely on lipids and proteins in the form of 
royal jelly fed to them by workers, and drones fall somewhere in 
between (Winston 1987a). After the food travels through the 
esophagus, it proceeds through four major organs in the abdo- 
men before waste is eliminated (Fig. 2A): (1) the crop, or foregut, 
an expandable sac that holds nectar collected from flowers; (2) 
the ventriculus, or midgut, where digestion is mainly thought to 
occur; (3) the small intestine, where nutrients are absorbed; and 
(4) the rectum, or hindgut, which stores waste until it can be 
expelled outside of the hive. 
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Differences in the castes' diets are 
reflected in gut enzymes. Workers ex- 
press threefold more alpha-glucosidases 
(gi:94158854, gi:94400901, gi:58585164) 
and alpha-amylase (gi:58585144) across 
the entire digestive tract (but does not 
meet significance at the P = 0.05 level) 
(Fig. 2B) than queens or drones. A worker's 
diet is rich in carbohydrates, given their 
need for easily accessible energy during 
frequent movement and flight activity. 
On the other hand, drones clearly are bet- 
ter equipped to use protein as an energy 
source, since they expressed higher av- 
erage levels (P < 0.05) of two trypsins 
(gi:328783370, gi: 110762227), two carboxy- 
peptidases (gi:328782267, gi:328782015), 
one chymotrypsin (gi: 110762229), and 
aminopeptidase (gi: 665 12450). Notably, 
the worker midgut appeared compara- 
tively devoid of these enzymes (Fig. 2C). 
To round out the three primary energy 
sources, four lipid binding and trans- 
port proteins (gi:328 780884, gi:328785302, 
gi:328787757, gi:328790232) were more 
highly expressed in the digestive tract of 
queens (Fig. 2D), in keeping with their 
royal jelly diet, which is relatively high in 
lipids (Howe et al. 1985). 

Robust detoxification machinery 
in workers 

Despite identical genomes, bees of each 
caste have very different life spans, rang- 
ing from a few weeks for an adult worker to 
a few years for a queen. These differences 
are thought to be at least partially attrib- 
utable to their varied ability to cope with 
oxidative stress (Finkel and Holbrook 
2000). However, our data indicate that 
bees express several glutathione-5-trans- 
f erases (Fig. 3 A), glutathione peroxidases, 
peroxiredoxins, superoxide dismutases, 
and thioredoxin proteins ubiquitously 
with little or no organ or caste bias (Fig. 
3B). This is not surprising, given that en- 
dogenous metabolic processes generate 
free radicals that can damage proteins, 
lipids, and DNA, and all three castes could 
expect roughly equivalent exposure to 
these. Catalase, which reduces hydrogen 
peroxide to water and oxygen, is heavily 
expressed in workers, followed by drones 
then queens (Fig. 3C). This enzyme is 
concentrated in the worker fat body, con- 
sistent with the liver-like functions of this 
organ. Malpighian tubules are another 
important site for detoxification because 
they selectively remove unneeded metab- 
olites from the hemolymph and sur- 
rounding organs (Dow 2009). Multidrug 
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resistant proteins (MDRPs) are rarely found in most organs, but at 
least five distinct isoforms are expressed in Malpighian tubules. 
More surprising, however, is the queen's uniformly high expression 
of all five of them compared with the other castes (Fig. 3D). 

The enzyme family responsible for detoxification and xeno- 
biotic breakdown is the cytochrome P450 monooxygenases (P450). 
These enzymes are concentrated in organs one might expect to 
be exposed to toxins directly, in the outer extremities, including 
the antennae, legs, and exoskeleton, as well as along parts of the 
digestive tract (Fig. 3E). Of the three castes, workers spend by far 
the most time outside the hive so would be most likely to en- 
counter insecticides, defensive phytochemicals, and pollutants; 
indeed, they express significantly higher levels of P450s compared 
with queens and drones (Fig. 3F). 

Our data do not support a link between detoxification en- 
zymes and life span. A queen's longevity cannot be solely attrib- 
uted to her heightened expression of MDRPs since workers are 
enzymatically better equipped to minimize the effect of xenobi- 
otics but their life span remains relatively short. In keeping with 
the principles of natural selection, bees have evolved such that 
each caste can neutralize the toxins their bodies typically generate 
or encounter. 



broad functions, such as maintaining hive unity. All other OBPs 
demonstrated a strong organ bias and typically were highest in 
queens (Fig. 4E,F,H). 

The overall observation that queens express the most OBPs 
seems to correlate inversely with the number of olfactory organs, 
the sensilla placodea, on the antenna surface (queens have —1600, 
workers -3000, and drones -30,000) (Winston 1987b). One hy- 
pothesis to explain the inverse correlation between OBP variety 
and sensilla placodea is that drones may have the most acute sense 
of smell, but only to very few pheromones. Drones do not have 
many functions, but they do have to be able to track down a virgin 
queen in flight to mate with her. In contrast, the queen is sur- 
rounded at all times by tens of thousands of workers and larvae, 
each emitting pheromones, so she may have evolved to have fewer 
antennal pores to avoid being overwhelmed. Her heightened ex- 
pression of many different OBPs may indicate that she is capable 
of sensing numerous types of odorant signals. It implies that she 
is also capable of responding to cues from workers and other 
sources, in addition to being the chief pheromone producer in 
the colony. 



Queens may have the best sense of smell 

Scent is a vital means of communication among bees, with pher- 
omones affecting the behavior and physiology of every member in 
the hive. The largely insoluble odorant molecules used by bees are 
carried through the aqueous lymph by soluble odorant binding 
proteins (OBPs). Despite the importance of OBPs, their specific 
substrates are unknown, although several expression studies exist 
for some organs and developmental stages (Foret and Maleszka 
2006; Dani et al. 2010). OBPs are expressed in nearly all regions of 
the body but are most concentrated in tissues exposed to the ex- 
ternal environment (Fig. 4A). Antennae have the richest variety of 
OBPs, expressing at least 18, plus two chemosensory proteins 
(CSPs), while queens had more than other castes, shown most 
clearly in the short proximal segment of the antenna called the 
scape (P = 0.038 and P = 0.093 compared with the drone and 
worker, respectively) (Fig. 4B). 

Expression levels of OBPs were roughly in-line with a semi- 
quantitative analysis of transcript abundance (Foret and Maleszka 
2006), especially with regard to the widespread presence of OBP14 
(Fig. 4C). This protein and CSP3 (Fig. 4D) were the most ubiquitous 
and most evenly distributed among all the castes, suggesting that 
they may share a ligand and/or that their ligands may have very 



Figure 1. An overview of the Honey Bee Protein Atlas. (A) Sample handling flowchart. Worker pro- 
teins from each organ served as a reference against which queen and drone organs were compared. The 
relative abundance of actin (B) and vitellogenin (C) in the organs of each caste are shown on a grayscale. 
The absence of an organ indicates that either the organ does not exist (e.g., stinger in drones) or the 
protein was not detected there. Whole-body averages are shown in parentheses; (**) P < 0.01 . (D) A 
diagrammatic representation of each tissue, and the percentage of quantified proteins in queens and/or 
drones whose expression level is significantly different from the worker at P < 0.05. Individual per- 
centage differences in queen-worker (red squares) and drone-worker (blue squares) comparisons for 
the tissues are shown as a radar graph (0% at the origin up to 1 5%) in E, arranged from the least different 
(wheel position 1) to most different (wheel position 49) tissue, (anf) flagellum (antenna); (ans) scape 
(antenna); (brn) brain; (crp) crop (forgut); (ftb) fat body; (gal) galea (mouthpart); (glo) glossa 
(mouthpart); (hrt) heart; (hyp) hypopharngeal gland (queen and worker only); (int) intestine; (Igf) front 
leg; (Igm) middle leg; (Igr) rear leg; (mal) Malpighian tubules; (man) mandibular gland; (mus) thorax 
muscle; (nrv) nerve; (poi) poison sac (queen and worker only); (rec) rectum; (sic) post-cerebral salivary 
gland; (sit) thoracic salivary gland; (stg) sting (queen and worker only); (stn) sternite; (ter) tergite; (ven) 
ventriculus (midgut). 



Queen immune system linked to longevity 

The recent unsustainable losses of managed bee colonies to in- 
fectious diseases (Evans and Schwarz 2011) underlines our poor 
understanding of innate immunity in bees. Initial indications 
from the genome sequence suggested that bees have far fewer 
immunity genes than other insects (The Honey Bee Genome 
Sequencing Consortium 2006), and our data suggest that even 
the few genes that the species does have are not protecting all 
three castes equally. The major melanization pathway enzyme, 
prophenoloxidase (PPO, gi:58585196), was nearly ubiquitous 
across organs but was highly biased toward queens, followed by 
workers and then drones (Fig. 5A). Interestingly, the only known 
enzyme that can activate PPO (gi:328 783409) was expressed solely 
in the heart of queens and workers (81%-19%, P < 0.05). 

This queen-biased expression also holds true for two patho- 
gen-binding proteins that were quantified: Peptidoglycan recog- 
nition protein S3 (PGRP-S3, gi:254910928) (Fig. 5B) and Gram- 
negative bacteria-binding protein 1-2 (GNBP 1-2, gi:254911140) 
(Fig. 5C). Trends for antimicrobial peptides were not as clear-cut; 
e.g., hymenoptaecin (gi: 58585174) (Fig. 5D), trended toward 
higher expression in queens but was not statistically significant. 
On the other hand, workers express much more defensin than 
other castes (Fig. 5E), although this is almost exclusively in the 
head and thoracic glands, as well as the hypopharyngeal glands 
which synthesize royal jelly for larvae 
and queens to consume. As such, the 
defensin workers do produce is probably 
not used for their own protection but, in 
true altruistic fashion, goes toward pro- 
tecting their queen. Overall, queens ap- 
pear to have the most robust immune 
system, even to the point of the other 
castes being completely expendable. 



Queen versus worker venom 

The stinger apparatus (i.e., the poison 
sac plus the stinger) is a bee's primary 
weapon and is exclusive to the female 
castes. Workers use it to defend the col- 
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Figure 2. Caste-dependent differences in major digestive system en- 
zymes. The abdominal sections of the digestive tract. Relative expression 
values of major enzymes for digestion of mono- and polysaccharides 
(B), proteins (C), and lipid transport (D) are shown on a grayscale. Aver- 
ages among the four organs are shown in parentheses; (*) P < 0.05, (**) 
P<0.01. 



ony, using a ratchet mechanism in the barbed stingers to help it 
initially lodge in and then v\^ork its way further into thick tissues, even 
as it means death for the v^^orker. Queens use their barbless stingers 
exclusively to kill rival queens. Previous analyses of venom composi- 
tion have identified certain proteins specific to v^^orker or queen 
venom, as might be expected given the contrasting end goals of their 
stinging behavior. The differences between queen and v^^orker venom 
are even more dramatic than previously thought, however. Phos- 
pholipase A2 (gi:58585172) is a v^^ell-knov^^n venom protein, and it is 
heavily expressed in the v^^orker throughout the body (Fig. 6A) and 
particularly in the stinger and poison sac (Fig. 6C; and as shov\^n 
before, Marz et al. 1981). However, a homolog (gi:l 10758297, 49% 
identical) is more abundant in the queen stinger (Fig. 6D). Differ- 
ences in their signal peptide and propeptide regions (Fig. 6B) suggest 
they are sorted and regulated differently. Queens also have more 
lipocortin (gi:328790767, also called annexin) (Fig. 6D), a known 
suppressor of phospholipase A2 activity. On the other hand, 
workers express a stinger-specific trypsin (gi: 328778042) (Fig. 
6C), which may cleave the phospholipase A2 propeptide with 
arginine at the C terminus. 

Melittin (gi:58585154), a pore-forming peptide that causes 
cell leakage (Raghuraman and Chattopadhyay 2007), is already 
knovm to make up 50% of the worker venom by weight (Habermann 
1972). In our data, workers have > 1000-fold more melittin than 
queens in the stinger (Fig. 6C). Given that it can also synergistically 



activate phospholipase A2, leading to release of pro-inflammatory 
arachidonic acid in the victim, it appears that worker venom 
is designed to cause quick and localized damage. Further- 
more, the extracellular matrix-degrading enzyme hyaluronidase 
(gi:58585182), the hydrogen peroxide-producing glucose oxidase 
(gi:58585090), and the lysosomal lipase-activating enzyme 
saposin (gi:328 782499) are all much higher in the worker poison 
sac and stinger (Fig. 6C). Given that workers need to defend the 
colony against predators that are often hundreds of thousands of 
times their own size (e.g., bears, humans), killing the antagonist 
would be a tall order, but inflicting enough pain to drive it away 
achieves the same goal. 

On the other hand, queen venom is used exclusively to kill rival 
queens. Queen venom contains higher levels of an apoptosis-in- 
ducing factor (gi:328783113), a proteasome activator (gi:328788437), 
and several peptidases (Fig. 6C), including a trypsin-like enzyme 
with unknown substrates (gi:58585116) (Georgieva et al. 2010), 
better known as the venom allergen Api m7. Thus, queen venom 
appears to be optimally suited to damaging cells and tissues. Even the 
anti-coagulant anti-thrombin III (gi:328782084) (Fig. 6C) helps to 
ensure that this toxic concoction circulates thoroughly in the victim 
queen's body. 

Insights into organ differentiation 

The body of data generated here can be used to explore how the 
proteomes among the castes differ, but just as interesting is its 
capacity to help us understand how each organ is similar or di- 
vergent from one another. To this end, we identified the proteins 
for each organ that were in approximately equal levels among all 
three castes (minimum 20% in each, thereby excluding heavily 
caste-specific proteins), averaged the number of peptides we were 
able to find for each protein between the worker-drone and 
worker-queen samples, and divided this value by the total number 
of peptides found in a given organ; this effectively corrected for the 
incongruent numbers of peptides (and proteins) found in each 
organ, and with these values we performed complete linkage 
clustering for the top 500 most abundant proteins (Fig. 7A). 

Numerous insights emerged from this analysis, but first we 
observed that some patterns are consistent with existing knowl- 
edge. For example, the front, middle, and rear legs have effectively 
identical functions (i.e., locomotion) and formed the tightest 
cluster. Other organs that have similar functions also have similar 
expression patterns, such as the major exoskeletal plates of the 
abdomen (ternite, sternite) and the mouthparts (galea, glossae). 
Not everything is so obvious, however. The nerve chord and the 
short antennal base (scape) are quite similar, perhaps related to 
their roles in transducing nerve signals. The small intestine, which 
is poorly understood in the bee, may have similar functions to the 
Malpighian tubules, which play a major role in osmoregulation 
and ion transport. Due to the physical proximity of these two or- 
gans, cross-contamination during dissection cannot be ruled out, 
but is unlikely to have had a major contribution since the two 
organs are morphologically quite different and easy to distinguish 
under a dissecting microscope. The heart and thoracic salivary 
gland are physically a good distance apart so cross-contamination 
cannot be a factor, yet the two form a tight cluster. This observation 
cannot be reasonably explained at this point — both are thought to 
be highly dissimilar (anatomically and physiologically) but are also 
not well studied. 

As far as we can ascertain, this clustering approach has never 
been attempted before for a large number of organs for a whole 
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ganism organ-level protein expression 
profiles become available will its validity 
be established, but at least the data set 
presented here allows us to systematically 
address how organs may have evolved, at 
least in insects. 
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Figure 3. Caste and organ distribution of detoxification enzymes. (A) GST proteins as a fraction of all 
quantified proteins across castes. (B) Peroxiredoxin (gi:328787790) and (C) catalase expression levels. 
Whole-body averages are shown in parentheses; (*) P < 0.05, (**) P < 0.01. (D) In the Malpighian 
tubules, MDRPs were collectively highly expressed in queens (red) compared with drones (blue) and 
workers (yellow). The size of each slice is determined by the relative proportion of a given protein in each 
caste; (*) P < 0.05 compared with the worker. A (gi:328789595), B (gi:3287871 48), C (gi:3287841 75), 
D (gi:328791 429), E (gi:328777607). (f ) P450 proteins as a fraction of all proteins across castes, and (f ) 
among the P450s, a caste breakdown of their average P450 expression (±SEM, n = 85). 



organism, likely because no comprehensive assessment of gene 
expression across organs has been attempted. The relationships 
among organs shown in Figure 7 A is obviously correlated to their 
individual functions. Visually, it echoes the dendrograms shown 
for the evolution of gene families, and for that reason, it may be 
possible to use these diagrams to draw parallels to organ evolution. 
When bilaterians first appeared in evolution, they would have had 
only one or two rudimentary organs, such as the precursor to the 
vertebrate heart and the dorsal vessel of arthropods (Xavier-Neto 
et al. 2007), so the organ systems of complex animals must have 
been the result of several divergent steps in evolution, much the 
same way as one gene might duplicate and then each copy evolves 
independently. Such a process is suggested in the organization of 
various animals that we see today, but there is little evidence of this 
in the fossil record since soft tissues are very poorly preserved. 
However, given that sequence homology between genes implies 
a common evolutionary origin (Koonin 2005), expression simi- 
larity at the protein level of different organs should reflect their 
ancestry. However, the dendrogram in Figure 7 A suggests an evo- 
lutionary path each organ may have followed from an undefined 
common ancestral organ to get to its modern state. While all the 
organs may have evolved from a single ancestor, it seems more 
likely that independent organogenesis events probably occurred 
throughout history. If so, then depicting all the organs as belong- 
ing to a single supercluster would be artificial; the real situation 
may have been more like that presented in Figure 7B. Clearly this is 
an entirely speculative interpretation of the data, and to the best of 
our knowledge, such a molecular approach to define organ evo- 
lution is completely novel, even among the vast microarray liter- 
ature. The lack of an appropriate fossil record of arthropod organs, 
or for any other phyla, makes verification of the relationships 
implied in Figure 7, A and B, impossible. Only as more pan-or- 



Conclusion 

Of the three levels of information in the 
Central Dogma, proteins are most closely 
linked to phenotype. Mass spectrometry- 
based proteomics thus enables a func- 
tional genomics view of gene expression 
in a tissue and how that may be linked to 
the function(s) of that tissue. Despite 
having nearly complete genomes for 
many multicellular organisms, no com- 
prehensive, organ/tissue-level proteomic 
description of a multicellular organism 
has been attempted previously, although 
the 28 mouse tissues reported by Geiger 
et al. (2013) comes closest. The 29 organs 
that we could obtain in sufficient quan- 
tities to analyze have enabled us to con- 
struct a detailed protein expression atlas 
for the adult stage of all three castes. The 
data are generally consistent with known 
bee biology but have revealed several 
novel insights into specific metabolic and 
sensory capabilities of individual organs and of specific castes. In 
this article we have merely selected a few highlights to discuss, but 
the vast wealth of data pertaining to other aspects of bee proteins 
function are available online in a searchable format that includes 
illustrated caste and organ comparisons for all quantifiable proteins 
(http://www.chibi.ubc.ca/faculty/leonard-foster/foster-lab/data/ 
under Bee Protein Atlas). This resource should serve as a catalyst 
for studying the finer details of biological regulation, such as post- 
translational modifications and protein-protein interactions. 

Perhaps the most profound and novel finding here is the first 
hint of how animal organs may have evolved and, indeed, even the 
realization that questions along this line can be answered. Com- 
parative genomics helps us to understand the evolutionary re- 
lationship between organisms. The fossil record is also useful in 
this regard and can explain how certain structures like limbs 
evolved, but neither fossils nor genomics are useful for softer 
structures. By looking at protein expression profiles across many 
organs, however, a picture of which individual organs may share 
common ancestral structures begins to emerge. As with fossils and 
genomes, however, data from a single organism are not sufficient 
to build an accurate picture; our understanding of how organs may 
have evolved will become clearer when data from multiple species 
become available. 



Methods 

Reagents 

All salts and chemicals were of analytical grade or better and were 
obtained from Sigma-Aldrich unless otherwise indicated. All sol- 
vents were of HPLC grade and were obtained from ThermoFisher 
Scientific. The following materials were obtained as indicated: 
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Figure 4. Caste and organ distribution of OBPs. (A) OBPs as a fraction of 
all quantified proteins across castes. (B) The percent expression of all 
quantified OBPs in each caste, averaged across the flagellum (the long 
distal portion of the antenna, n = 16) and the scape (the short proximal 
portion, n = 1 1 ) (±SEM). The relative abundance of OBP1 4 (C), CSP3 (D), 
OBPS (f), OBPl 7 (0, OBP18 (C), and OBP21 (H) in the organs of each 
caste is shown on a grayscale. Whole-body averages are shown in pa- 
rentheses; (*) P < 0.05, (**) P < 0.01 . 



endopeptidase Lys-C, Wako Chemicals; porcine-modified trypsin, 
Promega; loose ReproSil-Pur 120 C18-AQ 3 [xm, Dr. Maisch 
(Ammerbuch-Entringen, Germany); 96-well full skirt PGR plates, 
Axygen; fused silica capillary tubing, Polymicro; soft forceps for 
holding bees, BioQuip; protease inhibitor mixture, Roche Applied 
Science. 



Apiculture and bee dissection 

A randomly selected, naturally mated queen was initially chosen 
as the initial source of all bee organ. This queen's colony and all 
her progeny colonies were maintained at an apiary in Langley, 
British Golumbia from 2007 to 2010, during which time adult 



bees were collected as needed. Dissections were all performed as 
described (Dade 1994), and all organs were washed with phos- 
phate-buffered saline (PBS) three times and frozen until proteins 
could be extracted. Three biological replicates of each organ were 
dissected, and material from multiple animals (up to 30 in some 
cases) was pooled to obtain enough protein material for each 
replicate. 



Protein extraction and digestion 

Exoskeleton-containing hard tissues were homogenized in a bead 
mill using a single tungsten bead in each 2-mL self-locking tube 
(Eppendorf ). Tissues were pulsed at 30 H for 5 min in 50 |jlL of PBS 
containing a protease inhibitor cocktail (Roche) at 8x the sug- 
gested concentration. Soft tissues were placed on a glass plate and 
cut into smaller pieces before collecting them in a microfuge tube 
containing 50 |jlL of PBS. All samples were centrifuged for 10 min at 
16,100 relative centrifugal force (RGF) at 4°G to pellet cells and 
debris, and the supernatant was removed. Lysis buffer (100 fxL of 
1% NP-40, 150 mM NaGl, 20 mM Tris at pH 7) was added to the 
pellet, and the contents were passed through a 25 G needle 10 
times. The sample was clarified for 10 min at 16,100 RGF at 4°G, 
and the pelleted debris was discarded. Proteins were precipitated 
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Figure 5. Caste and organ distribution of immunity proteins. (4) PPO, 
(S) PGRP-S3, and (C) GNBP 1 -2 expression is significantly higher in queens 
compared with the other castes. Overall expression levels of the antimi- 
crobial peptides hymenoptaecin (D) and defensin (f) are not different {P > 
0.1 ), except that drones express far less defensin than queens or workers. 
Relative levels across each organ shown on a grayscale. Whole-body av- 
erages are shown in parentheses; (*) P < 0.05, (**) P < 0.01 . 
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Figure 6. Proteome of the poison and sting apparatus: select proteins. (4) Relative abundance of phospholipase A2 (gi:585851 72) in the organs of each 
caste is shown on a grayscale, and the whole-body average is shown in parentheses. Worker expression is significantly different from the others; (**) ? < 
0.01). (S) Multiple sequence alignment of <1> phospholipase A2 and <2> phospholipase A2-like (gi:l 10758297). The signal peptide (gray) and pro- 
peptide (underlined) are as given in Kuchler et al. (1 989). Select proteins that tend to be more highly expressed in (C) workers (white bars) and (D) queens 
(gray bars) in the poison sac (P) and stinger (S) are shown; (*) ? < 0.05, (#) the protease also found in the digestive tract. 



for 3 h through the addition of 1 mL of ethanol (100%) with 20 |jlL 
of 3 M sodium acetate (pH 5) and 20 fxg of glycogen. Protein pellets 
were collected by centrifugation for 10 min at 16,100 RCF, briefly 
dried for 5 min in a vacuum centrifuge, and resolubilized in as 
little solubilization buffer (6 M urea, 2 M thiourea in 10 mM 
HEPES at pH 8) as was required to fully resolubilize the pellet 
(as low as 2 |jlL). The Coomassie Plus Protein Assay (Pierce) was 
used to determine protein concentrations of the tissue lysates 
according to the manufacturer's instructions before they were 
stored at -20°C until downstream analysis. Proteins were then 
digested as described (Chan et al. 2011a); desalted, isotopically 
labeled by dimethylation, and desalted (Chan and Foster 2008). 
Labeled proteins were mixed prior to the second desalting, with 
the worker sample acting as the reference in mixtures of worker- 
queen, worker-drone, and worker-queen-drone. Caste-specific 
organs (i.e., reproductive organs) were analyzed without isotopic 
labeling. Samples that contained a total of 7 |jLg of protein or more 
were analyzed in the mass spectrometer in duplicate, or step- 
gradient-fractionated by strong cation exchange STAGE tips 
(Ishihama et al. 2006). Liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) was conducted as described (Chan 



et al. 2011a) using a 1100 Series nanoflow high-performance 
liquid chromatography system (Agilent) with a CI 8 column, di- 
rectly coupled through a nanoelectrospray ion source (Proxeon) 
to a linear trapping quadrupole Orbitrap XL (LTQ Orbitrap XL, 
Thermo Scientific). 

Data processing 

Fragment spectra were searched against the Apis Official Gene 
Set (v2, 22,037 sequences) using MaxQuant (vl.2.0.13) (Cox 
and Mann 2008) with default parameters. Lys and N-terminal 
dimethylation with the three isotopologs of dimethyl groups 
were considered, but otherwise default parameters were used for 
both identification and quantitation. For a protein to be con- 
sidered identified, two or more unique peptides were required. 
Quantitative data extracted by MaxQuant were further processed by 
Perseus, converting peak volume data for all isotope labels into 
natural logarithm-transformed ratios of Worker/Drone (W/D) and 
Worker/ Queen (W/Q). Statistical significance for these relative 
quantitation values were obtained by calculating Significance B in 
the Perseus module of MaxQuant (Cox and Mann 2008) at P < 
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Figure 7. Evidenced -based insight into organ differentiation and specialization within castes. Den- 
drograms generated from the top 500 most common proteins among the tested tissues propose two 
models of organogenesis: (A) a common, single ancestral organ, or (B) multiple organogenesis events. 
The time scale and its linearity are uncertain but likely go back to the appearance of early bilaterans. 
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0.05, after correction for multiple hypothesis testing. The W/D and 
W/Q ratios, where available for each protein, are converted into 
percent values, with the total expression of W + D + Q assigned as 
100%. These percent expression values are represented on an 
inverted grayscale (i.e., 100% expression is converted to 0 or black, 
and 0% expression is converted to 100 or white) for each organ of 
each caste (e.g.. Fig. IB). Missing organs in the diagram indicate 
that the organ does not exist or that the protein did not meet the 
limit of quantitation there. Whole-body expression values for each 
protein are calculated by averaging the percent expression values 
across each organs for each caste (each organ treated as one n), and 
two-tailed f-tests were used to evaluate whether a caste is signifi- 
cantly different from the other two. For the digestive system, 
similar calculations were conducted to generate the diagram and 
expression values, but only limited to carbohydrate and peptide 
degradation and lipid transport proteins (see the body of the text 
for their identities) and only to the crop, ventriculus, small in- 
testine, and rectum. 

For particular protein families of interest, the numbers of 
isoforms expressed in each organ that fall within the limit of 
quantitation were added (regardless of caste) and divided by the 
total number of quantified proteins in that organ (as seen in Figs. 
4A,E, 5A). For Figure 4E, each measurement of each P450 subtype 
level across each organ where detectable across all three castes is 
considered one n, for a total of n = 85. For Figure 5 A, all detectable 
OBPs or CSPs were considered in the average values as one n, 
equating to n = 16 for the flagellum and n = 11 for the scape. 

Data access 

The mass spectrometry proteomics raw data have been deposited 
in the ProteomeXchange Consortium (http://proteomecentral. 
proteomexchange.org) via the PRIDE partner repository (Vizcaino 
et al. 2010) with the data set identifier PXD000084. The spectra are 
publicly available at PeptideAtlas (http://www.peptideatlas.org/ 
builds/honeybee/). Interpreted data for all quantifiable proteins 
can be browsed and searched in an interactive graphical format 
(http://www.chibi.ubc.ca/faculty/leonard-foster/foster-lab/data/ 
under Bee Protein Atlas). 
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